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Abstract 

A search for a massive resonance decaying into a standard-model-like Higgs boson 
(H) and a W or Z boson is reported. The analysis is performed on a data sample cor¬ 
responding to an integrated luminosity of 19.7 fb 1 , collected in proton-proton colli¬ 
sions at a centre-of-mass energy of 8 TeV with the CMS detector at the LHC. Signal 
events, in which the decay products of Higgs, W, or Z bosons at high Lorentz boost 
are contained within single reconstructed jets, are identified using jet substructure 
techniques, including the tagging of b hadrons. This is the first search for heavy res¬ 
onances decaying into HW or HZ resulting in an all-jet final state, as well as the first 
application of jet substructure techniques to identify H —> WW* —r 4q decays at high 
Lorentz boost. No significant signal is observed and limits are set at 95% confidence 
level on the production cross sections of W' and Zl in a model with mass-degenerate 
charged and neutral spin-1 resonances. Resonance masses are excluded for W 7 in 
the interval [1.0, 1.6] TeV, for Zl in the intervals [1.0, 1.1] and [1.3, 1.5] TeV, and for 
mass-degenerate W' and Zl in the interval [1.0,1.7] TeV. 
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1 Introduction 

Several theories of physics beyond the standard model (SM) predict the existence of vector reso¬ 
nances with masses above 1 TeV that decay into a W or Z vector boson (V) and a SM-like Higgs 
boson (H). Here we present a search for the production of such resonances in proton-proton 
(pp) collisions at a centre-of-mass energy of = 8 TeV. The data sample, corresponding to an 
integrated luminosity of 19.7 fb 1 , was collected with the CMS detector at the CERN LHC. 

The composite Higgs (lH3l and little Higgs models 1IH<3 address the hierarchy problem and 
predict many new particles, including additional gauge bosons, e.g. heavy spin-1 W' or Z' 
bosons (V')- These models can be generalized in the heavy vector triplet (HVT) framework UTJ. 
Of particular interest for this search is the HVT scenario B model, where the branching frac¬ 
tions B( W' —> WH) and B(Zl —> ZH) dominate over the corresponding branching fractions 
to fermions, and are comparable to B (W' —> WZ) and B(Zl —> WW). In this scenario, experi¬ 
mental constraints from searches for boson decay channels are more stringent than those from 
fermion decay channels. Several searches B8til2l for W' —> WZ based upon the Extended Gauge 
Boson (EGB) reference model Ifl3t have excluded resonance masses below 1.7 TeV. Unlike the 
HVT scenario B model, the EGB model has enhanced fermionic couplings and the mass limit 
is not directly comparable to this work. Model independent limits on the cross section for the 
resonant production £v + jets Ifl4l can be used to extract resonance mass limits on the processes 
W' —> WZ and Z' —» WW of 1.7 TeV and 1.1 TeV, respectively. A search for Z ' —r ZH —> qqrr 
was reported in Ref. I1T51 and interpreted in the context of HVT scenario model B; however, no 
resonance mass limit could be set with the sensitivity achieved. Finally, a recent search Ifl6l 
combining leptonic decays of W and Z bosons, and two b-tagged jets forming a H —> bb can¬ 
didate excluded HVT model A with coupling constant gv = 1 for heavy vector boson masses 
below »z v /o < 1360 GeV and m V '± < 1470 GeV. 

The signal of interest is a narrow heavy vector resonance V' decaying into VH, where the V 
decays to a pair of quarks and the H decays either to a pair of b quarks, or to a pair of W bosons, 
which further decay into quarks. The H in the HVT framework does not have properties that 
are identical to those of a SM Higgs boson. We make the assumption that the state observed by 
the LHC Collaborations 1X71 [181 is the same as the one described by the HVT framework and 
that, in accord with present measurements Itl9l 20], its properties are similar to those of a SM 
Higgs boson. 

In the decay of massive V' bosons produced in the pp collisions at the LHC, the momenta of 
the daughter V and H are large enough (>200 GeV) that their hadronic decay products are 
reconstructed as single jets 112X1 . Because this results in a dijet topology, traditional analysis 
techniques relying on resolved jets are no longer applicable. The signal is characterized by a 
peak in the dijet invariant mass (ffljj) distribution over a continuous background from mainly 
QCD multijet events. The sensitivity to b-quark jets from H decays is enhanced through subjet 
or jet b tagging l22l . Jets from W/Z —> qq', H —> bb, and H —>■ WW* —> 4q decays are identified 
with jet substructure techniques 1231124] , 

This is the first search for heavy resonances decaying via VH into all-jet final states and it 
incorporates the first application of jet substructure techniques to identify H —> WW* —> 4q at 
a high Lorentz boost. 
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4 Event reconstruction and selection 


2 The CMS detector 

The central feature of the CMS apparatus is a superconducting solenoid of 6 m internal diam¬ 
eter, providing a magnetic field of 3.8 T. Within the field volume are a silicon pixel and strip 
tracker, a lead tungstate crystal electromagnetic calorimeter, and a brass and scintillator hadron 
calorimeter, each composed of a barrel and two endcap sections. Muons are measured in gas- 
ionization detectors embedded in the steel flux-return yoke outside the solenoid. Extensive 
forward calorimetry complements the coverage provided by the barrel and endcap detectors. 
A more detailed description of the CMS detector, together with a definition of the coordinate 
system used and the relevant kinematic variables, can be found in Ref. |25| . 


3 Signal model and simulation 

In the HVT framework, the production cross sections of W' and Z! bosons and their decay 
branching fractions depend on three parameters in addition to the resonance masses: the 
strength of couplings to quarks (c q ), to the H (ch), and on their self-coupling (gy). I n the 
HVT model B, where gv = 3 and c q = — Ch = 1, W' and Zl preferentially couple to bosons 
(W/Z/H), giving rise to diboson final states. This feature reproduces the properties of the W' 
and Z! bosons predicted by the minimal composite Higgs model. In this case, the production 
cross sections for Z', W' , and W /+ are respectively 165, 87, and 248 fb for a signal of reso¬ 
nance mass niyi = 1 TeV. Their branching fractions to VH and decay width are respectively 
51.7%, 50.8%, 50.8% and 35.0, 34.9, 34.9 GeV. The resonances are assumed to be narrow, i.e., 
with natural widths smaller than the experimental resolution in m ;j for masses considered in 
this analysis. 

We consider the W' and Z' resonances separately, and report limits for each candidate individ¬ 
ually to permit the reinterpretation of our results in different scenarios with different numbers 
of spin-1 resonances. 

Signal events are simulated using the MadGraph 5.1.5.11 [j2'6| Monte Carlo event generator to 
generate partons that are then showered with PYTHIA 6.426 11271 to produce final state particles. 
These events are then processed through a Geant4 |28t based simulation of the CMS detector. 
The MadGraph input parameters are provided in Ref. Ii29l and the H mass is assumed to 
be 125 GeV. Samples showered with HERWIG++ 2.5.0 Il30ll are used to evaluate the systematic 
uncertainty associated with the hadronization. Tune Z2* 1311 is used in PYTHIA, while the 
version 23 tune f30| is used in HERWIG++. The CTEQ6L1 Il32l parton distribution functions 
(PDF) are used for MadGraph, pythia and HERWIG++. Signal events are generated from 
resonance mass 1.0 to 2.6 TeV in steps of 0.1 TeV. Signals with resonance masses between the 
generated values are interpolated. 

The distribution of the background is modelled from the data. However, simulated samples of 
multijet and tt events, generated using MadGraph 5vl.3.30 fl26] and POWHEG 1.0 133H35I , re¬ 
spectively, and interfaced to PYTHIA for parton showering and hadronization, serve to provide 
guidance and cross-checks. 

4 Event reconstruction and selection 

The event selection, in the online trigger as well as offline, utilizes a global event description 
by combining information from the individual subdetectors. Online, events are selected by at 
least one of two specific triggers: one based on the scalar sum of the transverse momenta pj of 



3 


the jets (Hj), which requires Hj > 650 GeV; the other on the invariant mass of the two jets with 
highest pj, which requires mjj > 750 GeV. 

The offline reconstruction is described below. 

Events must have at least one primary vertex reconstructed with \z\ < 24 cm. The primary 
vertex used in the event reconstruction is the one with the largest summed pj of associated 
tracks. Individual particle candidates are reconstructed and identified using the particle-flow 
algorithm [[361 1371, and divided into five categories: muons, electrons, photons (including those 
that convert into e ' e pairs), charged hadrons, and neutral hadrons. Charged particle candi¬ 
dates associated with a primary vertex different from the one considered for the event recon¬ 
struction are discarded, which reduces contamination from additional pp interactions in the 
same bunch crossings (pileup). 

Jets are clustered from the remaining particle flow candidates, except those identified as iso¬ 
lated muons, using the Cambridge-Aachen (CA) If38l |391 jet clustering algorithm as imple¬ 
mented in FASTjET Phi [4111. This algorithm starts from a set of particles as "protojets". It 
combines them iteratively with each other into new protojets until the distance of the result¬ 
ing protojet to the closest remaining protojet is larger than the distance parameter of the CA 
algorithm. A distance parameter of 0.8 is used (CA8 jets). An event-by-event correction based 
on the jet area method BEUBH is applied to remove the remaining energy deposited by neutral 
particles originating from pileup. The pileup-subtracted jet four-momenta are then corrected 
to account for the difference between the measured and true energies of hadrons Il44l . Jet iden¬ 
tification criteria |45l are applied to the two highest pj jets in order to remove spurious events 
associated with calorimeter noise. 

The jet reconstruction efficiencies (estimated from simulation) are larger than 99.9%, and con¬ 
tribute negligibly to the systematic uncertainties for signal events. 

Events are selected by requiring at least two jets each with pj > 30 GeV/c and pseudorapidity 
| ?/1 < 2.5. The two highest pj jets are required to have a pseudorapidity separation |A//| < 1.3 
to reduce background from multijet events l46l . The invariant mass of these two jets is re¬ 
quired to satisfy > 890 GeV/c 2 . The trigger efficiency for the events passing the preselection 
requirements exceeds 99%. 

To enable the results to be applied to other models of similar final states, we utilize simulations 
to derive the geometrical acceptances and the W/Z and H selection efficiencies. These are 
presented separately in Figs. [TJ [6j and[7j respectively. 

For the purpose of reinterpreting the result, the global efficiency is presented approximated 
by the product of acceptances and the W/Z and H selection efficiency, restricted to final states 
where the W/Z and H bosons decay hadronically. The products of acceptances and the W/Z 
and H tagging efficiency, ignoring the correlations between detector acceptance and W/Z or H 
tagging, agree to better than 10% with the full event simulation. In the interpretations reported 
in this paper, the global efficiency is estimated from the full simulation of signal events, such 
that the correlations between the acceptance and W/Z and H selection efficiency are properly 
taken into account. However, when re-interpreting this search in terms of an arbitrary model, 
an additional uncertainty of 10% should be folded in, to allow for the possible effect of correla¬ 
tions. 

The acceptance, shown in Fig. [l] as a function of the dijet resonance mass for several signals, 
takes into account the angular acceptance (1 1 ) \ < 2.5, |A;/1 < 1.3). 

The two highest pj jets are chosen as candidates for the hadronically decaying W/Z and H 
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4 Event reconstruction and selection 



Resonance mass (TeV) 


Figure 1: The fraction of simulated signal events for hadronically decaying W/Z and H bosons, 
reconstructed as two jets, that pass the geometrical acceptance criteria (|f/| < 2.5, | A;/1 < 1.3), 
shown as a function of the resonance mass. 

bosons, and W/Z and H tagging algorithms based on jet substructure are applied. 

Information characterizing jet substructure is derived using three separate algorithms, produc¬ 
ing the variables primed jet mass, subjet b tagging, and N-subjettiness. The combined use of these 
variables in event selection strongly suppresses the background from QCD dijet production. 
All three characterizations of jet substructure are defined and discussed in detail in the follow¬ 
ing paragraphs. 

As the mass of the V or H boson is larger than the mass of a typical QCD jet, the jet mass is 
the primary observable that distinguishes such a jet from a QCD jet. The bulk of the V or H 
jet mass arises from the kinematics of the two or more jet cores that correspond to the decay 
quarks. In contrast, the QCD jet mass arises mostly from soft gluon radiation. For this reason, 
the use of jet pruning Il47ij48| improves discrimination by removing the softer radiation, as this 
shifts the jet mass of QCD jets to smaller values, while maintaining the jet mass for V and H jets 
close to the masses of W, Z or H bosons. Jet pruning is implemented by applying additional 
cuts in the process of CA jet clustering. These cuts remove protojets that would have a large 
angle and low pj with respect to the combination with another protojet. The details of this 
procedure are given in Ref. |24| . The distributions of the pruned jet mass (mj) for simulated 
signal and background samples, are shown in Fig. [2] Jets from boosted W and Z decays are 
expected to generate peaks at mj ~ 80 and mj ~ 90 GeV, respectively. Jets from boosted H 
decays are expected to peak at mj ~ 120 GeV. Fladronic top-quark jets, where the b quark and 
the two different light quarks from the t —> Wb —> qq'b decay are required to be within a 
reconstructed CA8 jet, peak at ??Zj ~ 175 GeV. The peak around 20 GeV arises from unmerged 
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light jets, mostly associated with quark- and gluon-induced jets from multijet events, but also 
from quark jets from W, Z, and H bosons in the cases where the decay products do not end up 
in a single jet. The contribution from bosons depends on their spin and polarization. All peaks 
are slightly shifted to lower masses because of the removal of soft radiation in jet pruning. If 
the pruned jet has a mass (mj) within 70 < ?;zj < 100GeV/c 2 ( 110 < nv f < 135GeV/c 2 ), it is 
tagged as a W/Z ( H ) candidate. 



Jet mass m (GeV) 


Figure 2: Distribution of pruned jet mass in simulation of signal and background processes. 
All simulated distributions are normalized to 1. The W/Z, H, and top-quark jets are required 
to match respective generator level particles in the event. The W/Z and H jets are from 1.5 TeV 
W' —> WH and Z' —> ZH signal samples. 

Jet pruning can also provide a good delineation of subjets within the CA8 jet. 

To tag jets from H —> bb decays, denoted as H bb jets, the pruned subjets, given by reversing 
the last step of the CA8 pruning recombination algorithm, are used as the basis for b tagging. 
Jets arising from the hadronization of b quarks (b jets) are identified using the "Combined Sec¬ 
ondary Vertex" b-tagging algorithm l49l . which uses information from tracks and secondary 
vertices associated with jets to build a likelihood-based discriminator to distinguish between 
jets from b quarks and those from charm or light quarks and gluons. The b-tagging discrimi¬ 
nator can take values between 0 and 1 with higher values indicating higher probability for the 
jet to originate from a b quark. The "loose" working point of the b-tagging algorithm f49l is 
chosen and is found to be optimal for both subjet and jet b tagging. It has a b-tagging efficiency 
of ~85%, with mistagging probabilities of ~40% for c-quark jets and ~10% for light-quark and 
gluon jets at jet pj near 80 GeV. The ratio of b-tagging efficiencies for data and simulation is 
applied as a scale factor f22l to the simulated signal events. To identify CA8 jets originating 
from H —» bb decays, we apply b tagging either to the two subjets or to the CA8 jet, based 
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4 Event reconstruction and selection 


on the angular separation of the two subjets (A R) Il22l . If A R is larger (smaller) than 0.3, the 
b-tagging algorithm is applied to both of the subjets (the single CA 8 jet). 

While the pruned jet mass is a powerful discriminant against QCD multijet backgrounds, 
the substructure of jets arising from V and H decays provides additional discrimination. In 
H —> WW' —> 4q decays, the boosted H decays into a final state of four quarks merged to¬ 
gether, denoted as an Hww jet, and has a different substructure than jets from V/H —> qq' 
decays. We quantify how well the constituents of a given jet can be arranged into N subjets 
by reconstructing the full set of jet constituents (before pruning) with the kj algorithm |[50l and 
halting the reclustering when N distinguishable protojets are formed. The directions of the N 
jets are used as the reference axes to compute the N-subjettiness H5lT - l53l r,v of the original jet, 
defined as 

l 

"Tv = ~r Y] Pi,k min(A R lk , A R lk ,..., AR Nrk ), (1) 

do k 

where pj rk is the pj of the k th constituent of the original jet and AK„^ is its angular distance 
from the axis of the nth subjet (with n = 1,2,... ,N). The normalization factor do for t,\j is 
d 0 = Y,k PJ,kRo, with Ro set to 0.8, the distance parameter of the CA algorithm. To improve 
the discriminating power, we perform a one-pass optimization of the directions of the subjets' 
axes by minimizing T\j [|24l [521. By using the smallest A R nk to weight the value of fj k in 
Eq. §, Tn yields small values when the jet originates from the hadronization of N or fewer 
quarks. The Tp = z-J t ; ratios T 21 , T 31 , T 32 , T 41 , T 42 , and T 43 have been studied to identify the best 
discriminators for jets from W/Z —> qq' and H —> WW* —> 4q decays. 

We find that T 21 is the most suitable variable for identifying W/Z —> qq' jets Ifl2l . The distri¬ 
bution of T 21 for the W/Z —* qq' signal, shown in Fig. [3j peaks below 0.4 and is almost fully 
contained within T 21 < 0.75, where we place our cut. In contrast, the QCD background peaks 
around 0.6. The figure shows only W/Z candidate jets with the pruned jet mass in the W/Z bo¬ 
son mass window. For this reason, the jets matched to the top quark are mostly true W bosons, 
and appear signal-like. However, they represent only a small fraction of the top quarks from 
tt events (cf. Fig. [2J), since in the kinematic regime considered in this search, the top quarks are 
highly boosted and the b jet rarely fails to merge with the W jet. The overall contribution from 
tt, after the full selection, is 1-3%. 

For H —> WW* —> 4q events, we find that the ratio T 42 works best to discriminate between 
four-pronged H —> WW* —» 4q and QCD jets. The discriminating power of T 42 can be seen in 
The T 42 distribution of Hww jets tends to peak around 0.55. By contrast, T 42 distributions 
of multijet background and W/Z jets have a larger fraction of events at large values of T 42 , 
especially after requiring a pruned jet mass in the range [110, 135] GeV. Jets from unmatched 
tt events peak together with QCD jets, since they contain a mixture of b-quark jets and W-jets, 
but relatively few fully merged top-quark jets. However, the T 42 distribution for matched top- 
quark jets tends to peak at smaller values, since for the same jet T 42 is nearly always less than 
T 32 , which is small for hadronic top-quark jets. 

In Fig. |4j the comparison between dijet data and the QCD multijet simulation shows that the 
simulated distribution is well reproduced, though shifted towards higher values of T 42 as com¬ 
pared with the data. A similar level of disagreement is known for the modelling of T 21 in QCD 
simulation in Ref. m. The disagreement does not affect this analysis since the background is 
estimated from data. For the signal scale factor, the uncertainties from the modelling of T 42 are 
taken into account. 

We select "high (low)-purity" W/Z jets by requiring T 21 < 0.5 (0.5 < T 21 < 0.75), denoted as the 
HP (LP) V tag. Given the shape of T 21 distribution for the W/Z signal, the HP V tag category 
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Figure 3: Distribution of the N-subjettiness ratio T 21 = T 2 /Ti, where T,v is given in Eq. {Ij, for 
simulated signal and background processes, and for data. The jets for which T 21 is calculated 
are required to satisfy the W/Z pruned jet mass requirement. The W/Z and top-quark jets are 
required to match respective generator level particles in the event. All simulated distributions 
are scaled to the number of events in data. 

has a higher efficiency than the LP V tag category. We select HP (LP) Hww jets by requiring 
T 42 < 0.55 (0.55 < T 42 < 0.65), denoted as the HP (LP) H tag. Here also the HP category has a 
higher efficiency than the LP category. 

Cross-talk between the H decay channels is possible; for example, two-pronged H decays (e.g. 
H —» bb, H —> cc) can be reconstructed as four-pronged H —» WW* —> 4q, as shown in 
Fig. 0 Because of its large branching fraction, H —* bb contributes a non-negligible number 
of events to the H —» WW* —> 4q tagged sample. In order to combine events from H —> bb 
and H —» WW* —> 4q channels into a single joint likelihood, these categories must be mutually 
exclusive. Since the H —» bb tagger has significantly lower background than H —> WW* —» 4q, 
it takes precedence in selecting events. We first identify the events that pass the H —> bb tagger, 
and only if they fail we test them for the presence of the H —> WW* —> 4q tag. Thus we arrive 
at the final division of events into five mutually exclusive categories. These event categories 
and their nomenclature are summarized in Table [Q 

The LP V tag and LP H tag category is not included in this analysis, since it is dominated by 
background and therefore its contribution to the expected significance of the signal is negligi¬ 
ble. Other H decay modes like H —> gg, H —> tt, H —> ZZ*, and H —> cc together contribute 
2-7% of the total H —> bb tagged events, and 18-24% of the total H —> WW* —> 4q tagged 
events, as shown in Fig. [5j In this analysis, we only consider the H —* bb and H —> WW* —> 4q 
channels. Other H channels passing the tagging requirements are conservatively viewed as 
background and included as systematic uncertainties, discussed in Section [ 6 ] 
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Figure 4: Distributions of T 42 in data and in simulations of signal (2TeV) and background 
events, without applying the pruned jet mass requirement (left) and with the pruned jet mass 
requirement applied (right). Matched top-quark, W/Z, and Hww jets are required to be con¬ 
sistent with their generator level particles, respectively. All simulated distributions are scaled 
to the number of events in data, except that matched top-quark background is scaled to the 
fraction of unmatched tt events times the number of data events. 

Table 1: Summary of event categories and their nomenclature used in the paper. The jet mass 
cut is 70 < nij < 100GeV/c 2 for the V tag and 110 < m,- < 135GeV/c 2 for the H tag. 


Categories 

V tag 

H tag 

V HP H bb 

r 2 i < 0.5 

b tag 

V LP H bb 

0.5 < T 21 < 0.75 

b tag 

V HP H^ 

r 2 i < 0.5 

T 42 < 0.55 

v lp h hp 
v n ww 

0.5 < T 21 < 0.75 

T 42 < 0.55 

v hp h lp 

v n ww 

r 2 i < 0.5 

0.55 < T 42 < 0.65 


The expected tag probabilities of the W, Z, and H selection criteria for signal and data events 
in different event categories are shown in Figs. [ 6 ] and [7j as a function of mjj. The W/Z and 
H —> WW* —> 4q tagging efficiencies for signal events in the H —> WW" —> 4q categories fall 
at high px, primarily because the T 42 distribution is px-dependent. 

The Monte Carlo modelling of V-tag efficiency is validated using high-px W —> qq' decays 
selected from a data sample enriched in semileptonic tt events If24| . Scale factors of 0.86 ± 
0.07 and 1.39 ± 0.75 are applied to the simulated events in the HP and LP V tag categories, 
respectively, to match the tagging efficiencies in the top pair data. The decay of H —> WW* —> 
4q produces a hard W jet accompanied by two soft jets from the off-shell W boson. As the 
H —> WW* —> 4q tagger is also based on the N-subjettiness variables, and the measured ratio 
T 42 / T 21 is well modelled by QCD simulation, it is reasonable to assume that the mismodelling 
of the shower by PYTHIA is similar to that in the case of V tagging. The H —> WW* —> 4q 
tagging efficiency scale factors are extrapolated using the same technique as for V tagging for 
both the HP and LP categories, respectively, with additional systematic uncertainties, which 
are discussed in Section [ 6 ] 
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Figure 5: Comparison of T 42 distributions for signal events failing the H —» bb requirement. 
These events are from the H —> WW* —> 4q, H —> bb, H —> gg, H —> cc, and H —> rr 

channels. The H jets are from a 1.5 TeV resonance decaying to VH. All curves are normalized 
to the product of the corresponding branching fraction and acceptance. 




Dijet invariant mass m N (TeV) 


Dijet invariant mass m N (TeV) 


Figure 6 : Tagged fractions in H —> bb,W/Z —> qq' signal channels and data as a function 
of dijet invariant mass, for categories of V HP Hbb (left) and V LP Hbb (right). Horizontal bars 
through the data points indicate the bin width. 


5 Resonance search in the dijet mass spectrum 

The resolution for the reconstruction is in the range 5-10% for all the five categories. The 
dominant background in this analysis is from multijet events with an additional 1-3% contri¬ 
bution from tt events. The background is modelled by a smoothly falling distribution for each 
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5 Resonance search in the dijet mass spectrum 




Figure 7: Tagged fractions in H —» WW* —> 4q,W/Z —» qq' signal channels and data as a 
function of dijet invariant mass, for categories of V hp H^ v (top), V hp H{^ v (bottom left) and 
V l p H[ p ’ v (bottom right). Horizontal bars through the data points indicate the bin width. 


event category, given by the empirical probability density function 


P D (wZjj) 


P 0 (l -??Zjj/Vs) Pl 

( m jj / v®) Pz 


( 2 ) 


The background model includes the small tt background, which falls smoothly in a similar way 
to the multijet background. 

Each event category has separate normalization Pq and shape parameters Pi and P 2 . This pa¬ 
rameterization was deployed successfully in a number of searches based on dijet mass spec¬ 
tra Il46l . A Fisher F-test If54l is used to check that no additional parameters are needed to 
model the individual background distributions, compared with the four-parameter function 
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used in Ii46l . We have also tested an alternative function /V (?ftjj ) = Pq/ fzftjj/ V s + Pi) Fl , and 
found it less favored by the F-test. 

The use of the alternative function in the analysis produces negligible changes in the final result 
and therefore, no systematic uncertainty is associated with this choice. 

We search for a peak on top of the falling background spectrum by means of a binned maximum 
likelihood fit to the data. 


The binned likelihood is given by 


„ A" ! e _Ai 

*=n 1 


n l \ 


(3) 


where A,- = //N;(S) + N,(B), }t is a scale factor for the signal, N) ( S ) is the number of events 
expected from the signal, and N,-(B) is the number expected from multijet background. The 
variable ft,- quantifies the number of observed events in the z th nu, bin. The number of back¬ 
ground events N,-(B) is described by the functional form of Eq. 0. The signal shape for each 
narrow-width resonance hypothesis is obtained by fitting the 7ftjj distribution from simulated 
events with a sum of a Gaussian and a Crystal Ball probability density function. The resulting 
shape is fixed and, as such, used in the combined signal and background fit. This procedure 
is repeated for each resonance hypothesis, sampling resonance masses from 1.0 to 2.6 TeV in 
steps of 50 GeV. While maximizing the likelihood, and the parameters of the background 
function are left unconstrained. The shape of the resonance is additionally modified to account 
for systematic uncertainties (described below); parameters controlling each source of system¬ 
atic uncertainty are also allowed to vary in the fit, albeit within constraints. For presentational 
purposes, a binning according to ftZjj resolution is used in this paper. However, the likelihood is 
calculated in bins of 1 GeV in mjj, approximating an unbinned analysis, while keeping it com¬ 
putationally manageable. 

Figs. [8] and |9j show the nij j distributions in data. The solid curves represent the results of 
the maximum likelihood fit to the data, fixing the number of expected signal events to zero, 
while the bottom panels show the corresponding pull distributions, quantifying the agree¬ 
ment between the background-only hypothesis and the data. The expected distributions of 
H —y bb, W/Z —> qq' and H —» WW* —> 4q, W/Z —> qq' signals at 1.0,1.5 and 2.0 TeV in each 
category, scaled to their corresponding cross sections are given by the dashed and dash-dotted 
curves. The resonance masses in VHbb channels are slightly lower than those of the VH WW 
channels because of missing neutrinos in b-hadron decays and partial misreconstruction of 
two-pronged H —» bb decays. 
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5 Resonance search in the dijet mass spectrum 




Figure 8: Distributions in mjj are shown for V HP Hbb category (left), V LP H bb category (right). 
The solid curves represent the results of fitting Eq. (|2]l to the data. The distributions for H —* 
bb,W/Z —> qq' contributions, scaled to their corresponding cross sections, are given by the 
dashed curves. The vertical axis displays the number of events per bin, divided by the bin 
width. Horizontal bars through the data points indicate the bin width. The corresponding pull 
distributions p ^~ Flt , where (r Data represents the statistical uncertainty in the data in a bin in 
mjj, are shown below each mv, plot. 
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Figure 9: Distributions in m jj are shown for V H[ ’H^, v (top), V hp H{^ v (bottom left), and 
V LP H^ ; P v (bottom right). The solid curves represent the results of fitting Eq. to the data. The 
distributions for H —> WW* —> 4q,W/Z —> qq'contributions, scaled to their corresponding 
cross sections, are given by the dashed and dash-dotted curves. The vertical axis displays the 
number of events per bin, divided by the bin width. Horizontal bars through the data points 
indicate the bin width. The corresponding pull distributions p ^~ Flt , where cr Data represents 
the statistical uncertainty in the data in a bin in m y, are shown below each plot. 
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6 Systematic uncertainties 


6 Systematic uncertainties 

The largest contributions to the systematic uncertainty are associated with the modelling of the 
signal, namely: the efficiencies of W/Z, H, and b tagging; the choice of PDF; the jet energy scale 
(JES); the jet energy resolution (JER); the pileup corrections; the cross-talk between different 
signal contributions; and the integrated luminosity 

The uncertainty in the efficiency for W/Z tagging is estimated using a control sample enriched 
with tt events described in Ref. l!24l . Uncertainties of 7.5% and 54% in the respective scale fac¬ 
tors for HP and LP V tag include contributions from control-sample statistical uncertainties, 
and the uncertainties in the JES and JER for pruned jets Ifl2l . The uncertainty due to the ex¬ 
trapolation of the simulated W/Z-tagging efficiency to higher jet pj is estimated by studying 
the W/ Z-tagging efficiency as a function of pj for two different showering and hadronization 
models using pythia 6 and HERWIG++, respectively. The results show that the differences are 
within 4% (12%) for the HP (LP) V tagging Il24l . 

We extrapolate the H —» WW* —> 4q tagging efficiency scale factor in the same way as the W/Z- 
tagging efficiency, with an additional systematic uncertainty based on the difference between 
PYTHIA 6 and HERWIG++ in modelling H —» WW* —» 4q decay. This is evaluated to be ss7% 
for the HP and LP H tag. The uncertainty from the pruned jet mass requirement in the H —> 
WW* —» 4q search is already included in the extrapolated scale factor uncertainty of the V-tag. 

The uncertainty in the efficiency of H —> bb tagging can be separated into two categories: the 
efficiency related to the b tagging and the efficiency related to the pruned H mass tag. The 
first is obtained by varying the b-tagging scale factors within the associated uncertainties l22l 
and amounts to 15%. The second is assumed to be similar to the mass selection efficiency of W 
jets estimated in Ref. If24l . additionally accounting for the difference in fragmentation of light 
quarks and b quarks, which amounts to 2.6% per jet. 

Because of the rejection of charged particles not originating from the primary vertex, and the 
application of pruning, the dependence of the W/Z- and H-tagging efficiencies on pileup is 
weak and the uncertainty in the modelling of the pileup distribution is <1.5% per jet. 

In this analysis, we only consider H —> bb and H —t WW* —t 4q decays. Other H decay 
channels that pass H taggers are viewed as nuisance signals, and a corresponding cross-talk 
systematic uncertainty is assigned. We evaluate this uncertainty as a ratio of expected nuisance 
signal events with respect to the total expected signal events, taking into account the branching 
fractions, acceptances and tagging efficiencies. The contamination from cross-talk is estimated 
to be 2-7% in the VH bb categories, and 18-24% in the VHww categories, and we take the max¬ 
imum as the uncertainty. The analysis is potentially 7% (24%) more sensitive than quoted, but 
since it is not clear how well the efficiency for the nuisance signals is understood, they are ne¬ 
glected, yielding a conservative limit on new physics. When the VH b b and VHww categories 
are combined together, the 24% uncertainty becomes a small effect, based on a quantitative 
measure of sensitivity suggested in Ref. H55| : 

r g(H^XX) £s 
1 + 

where £>(H —> XX) is the branching fraction for the H decay channel, eg is the signal tagging 
efficiency, and Nb is the corresponding background yield. The values of P for each channel are 
shown in Table H) 

The JES has an uncertainty of 1-2% |44] [56j, and its pj and rj dependence is propagated to the 
reconstructed value of mjj, yielding an uncertainty of 1%, independent of the resonance mass. 
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Table 2: Summary of the values P for a Z' signal at 1.5 TeV resonance mass and the correspond¬ 
ing background yield in all five categories. 

Signal/Categories V HP H bb V LP H bb V HF H^ v V LF H^ 

H —> bb, Z —> qq 2.3 xl0 ~ 2 4.8 xl0 ~ 3 1.0 xl0 “ 3 1.6 xl0 “ 3 3.9 xl0 “ 4 

H ->■ WW* 4q,Z ->■ qq 5.6 xl0 ~ 4 «0 2.6 xl0 “ 3 9.8 xl0 “ 4 4.5 xl0 “ 4 

The impact of this uncertainty on the calculated limits is estimated by changing the dijet mass 
in the analysis within its uncertainty. The JER is known to a precision of 10%, and its non- 
Gaussian features observed in data are well described by the CMS simulation Il44l . The effect 
of the JER uncertainty on the limits is estimated by changing the reconstructed resonance width 
within its uncertainty. The integrated luminosity has an uncertainty of 2.6% [57], which is also 
taken into account in the analysis. 

The uncertainty related to the PDF used to model the signal acceptance is estimated from the 
CT10 [58] , MSTW08 [59] , and NNPDF21 [60] PDF sets. The envelope of the upward and down¬ 
ward variations of the estimated acceptance for the three sets is assigned as uncertainty [61] 
and found to be 5-15% in the resonance mass range of interest. A summary of all systematic 
uncertainties is given in Table [3] and [4j Among these uncertainties, the JES and JER are applied 
as shape uncertainties, while others are applied as uncertainty in the event yield. 


Table 3: Systematic uncertainties common to all categories. 


Source 

HP uncertainties (%) 

LP uncertainties (%) 

JES 

1 

1 

JER 

10 

10 

Pileup 

<3.0 

<3.0 

PDF 

5-15 

5-15 

Integrated luminosity 

2.6 

2.6 

W tagging 

7.5 

54 

W tag pi dependence 

4 
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Table 4: Systematic uncertainties(%) for X —> VH signals, in which H —* bb and H —» WW* —* 
4q. Numbers in parentheses represent the uncertainty for the corresponding LP category. If LP 
has the same uncertainty as HP, only the HP uncertainty is presented here. 


Final state 


Source 

H —> bb 
VH bb VHww 

H ->■ WW* ->■ 4q 

VHww 

H —>■ bb mass scale 

2.6 

— 

— 

H(4q) tagging 

— 

7.5 (54) 

7.5 (54) 

H(4q)-tag T 42 extrapolation 

— 

7 

7 

Cross-talk 

7 

24 

24 

b tagging 

< 15 

< 15 

— 


7 Results 

The asymptotic approximation [62] of the LHC CL S criterion [6311641 is used to set upper limits 
on the cross section for resonance production. The dominant sources of systematic uncertain¬ 
ties are treated as nuisance parameters associated with log-normal priors in those variables. For 
a given value of the signal cross section, the nuisance parameters are fixed to the values that 
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8 Summary 


maximize the likelihood, a method referred to as profiling. The dependence of the likelihood 
on parameters used to describe the background in Eq. (|2j is treated in the same manner, and no 
additional systematic uncertainty is assigned to the parameterization of the background. 


Events from the 5 categories of Tableware combined into a common likelihood, with the uncer¬ 
tainties of the HP and LP H tag (V tag) efficiencies considered to be anticorrelated between HP 
and LP tagging because events failing the HP T 42 (T 21 ) selection migrate to the LP category and 
the fraction of events failing both HP and LP requirements is small compared to the HP and 
LP events. The branching fractions of H —> WW* —» 4q and H —» bb decays are taken as fixed 
values in joint likelihood. The remaining systematic uncertainties in the signal are fully corre¬ 
lated across all channels. The variables describing the background uncertainties are treated as 
uncorrelated. Pig. 10 shows the observed and background-only expected upper limits on the 
production cross sections for Zl and W', including both H —> bb and H —» WW* —» 4q decays, 
computed at 95% confidence level (CL), with the predicted cross sections for the benchmark 
models overlaid for comparison. In the HVT model scenario B, W' and Z' are degenerate in 
resonance mass, thus we compute the limit on their combined cross section under this hypoth¬ 
esis, shown in Fig. [TT] Table [5] shows the exclusion ranges on resonance masses. 


Table 5: Summary of observed lower limits on resonance masses at 95% CL and their expected 
values, assuming a null hypothesis. The analysis is sensitive to resonances heavier than 1 TeV. 


Process 

Observed 

lower mass limit (TeV) 

Expected 

lower mass limit (TeV) 

W' - 

-> HW 

[ 1 . 0 , 1 . 6 ] 

1.7 

Zl - 

-> HZ 

[1.0,1.1], [1.3,1.5] 

1.3 

V' - 

■» VH 

[1.0,1.7] 

1.9 


8 Summary 

A search for a massive resonance decaying into a standard model-like Higgs boson and a W or 
Z boson is presented. A data sample corresponding to an integrated luminosity of 19.7 fb 1 col¬ 
lected in proton-proton collisions at y/s = 8 TeV with the CMS detector has been used to mea¬ 
sure the W/Z and Higgs boson-tagged dijet mass spectra using the two highest pj jets within 
the pseudorapidity range |//| < 2.5 and with pseudorapidity separation |A;/| < 1.3. The QCD 
background is suppressed using jet substructure tagging techniques, which identify boosted 
bosons decaying into hadrons. In particular, the mass of pruned jets and the N-subjettiness 
ratios T 21 and T 42 , as well as b tagging applied to the subjets of the Higgs boson jet, are used 
to discriminate against the otherwise overwhelming QCD background. The remaining QCD 
background is estimated from a fit to the dijet mass distributions using a smooth function. We 
have searched for the signal as a peak on top of the smoothly falling QCD background. No 
significant signal is observed. In the HVT model B, a Zl is excluded in resonance mass inter¬ 
vals [1.0, 1.1] and [1.3, 1.5] TeV, while a W' is excluded in the interval [1.0, 1.6] TeV. A mass 
degenerate W' plus Zl particle is excluded in the interval [1.0,1.7] TeV. 

This is the first search for heavy resonances decaying into a Higgs boson and a vector boson 
(W/Z) resulting in a hadronic final state, as well as the first application of jet substructure 
techniques to identify H —> WW* —» 4q decays of the Higgs boson at high Lorentz boost. 
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Resonance mass (TeV) 


Resonance mass (TeV) 


Figure 10: Expected and observed upper limits on the production cross sections for Z' —» HZ 
(left) and W' —> HW (right), including all five decay categories. Branching fractions of H and 
V decays have been taken into account. The theoretical predictions of the HVT model scenario 
B are also shown. 



Figure 11: Expected and observed upper limits on the production cross section for V' —> VH, 
obtained by combining W' and Z' channels together. Branching fractions of H and V decays 
have been taken into account. The theoretical prediction of the HVT model scenario B is also 
shown. 
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